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CIS-ELEMENT REPORTER CONSTRUCTS AND USES THEREOF 

5 

BACKGROUND OF THE INVENTION 

r.ross-Refp.rftncft to Rftlatp.H Applications 

This application claims benefit of priority of pending non- 
10 provisional application U.S. Serial No. 09/206,887, filed December 8, 
1998. 

Field of the. Invention 

The present invention relates generally to the field of 
molecular biology of transcription. More specifically, the present 
15 invention relates to cis element reporter constructs for measuring 
transcription. 

Description of the Related Art 

Regulation of the activity of transcription factors is believed 

20 to be one of the most important biochemical controls to signal 
transduction, cell proliferation, differentiation, apoptosis, 
development, and disease. Studying the regulation of an individual 
transcription factor in various specific biological functional system has 
yielded a tremendous amount of information; however, the 

25 information that one can obtain from the current strategies of studying 
an individual factor is limited, despite the global functional status of a 
number of transcription factors. The information is limited because 
cross-talking among the different signal transduction pathways leads to 
activation of multiple transcription factors. Therefore, there is a need 
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to provide a tool to study the interaction and overall effect of various 

transcription factors. 

Green fluorescent protein (GFP) from the jellyfish Aequorea 
victoria is a reporter molecule for monitoring gene expression and 

5 protein localization in vivo, in situ and in real time (1-4). GFP 
fluoresces bright green upon mere exposure to UV or blue light-unlike 
other bioluminescent reporters which require additional proteins, 
substrates, or cof actors to emit light. GFP fluorescence is stable, 
species independent and can be monitored noninvasively in living cells. 

10 GFP fluorescence persists in formaldehyde-fixed cells and is well suited 
for double-labeling experiments with other fluorescent markers, 
including the GFP variant enhanced green fluorescent protein (EGFP, 
GFP mu „; 5-6). 

EGFP encodes a protein which has a single, red-shifted 
15 excitation peak and fluoresces about 35 times more intensively than 
wild type GFP when excited at 488 nm (7), due to an increase in its 
extinction coefficient (Em). To ensure maximal mammalian 

expression, the coding region of EGFP contains more than 190 silent 
base mutations which correspond to human codon-usage preferences 
20 (7). The red-shifted spectrum and increased expression of EGFP make 
it ideal for fluorescence microscopy and fluorescence-activated cell 

sorting (FACS; 5, 8). 

Destabilized EGFP (d2EGFP) contains the PEST domain from 
mouse ornithine decarboxylase (MODC), fused to the C-terminus of 
25 EGFP (9). This domain targets EGFP for rapid turnover effectively 
reducing EGFP's half-life to two hours. The introduction of d2EGFP 
greatly increases the utility of GFP in studying dynamic cellular-events 
in vivo. 

Another reporter molecule, secreted alkaline phosphatase 
30 (SEAP, 10-11), is a secreted form of human placental alkaline 
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phosphatase and has been used as a reporter for the analysis of ex- 
acting DNA sequences and frans-acting factors. Secreted alkaline 
phosphatase is efficiently secreted from cells, and the levels of SEAP 
activity in the culture medium are directly proportional to changes in 
5 intracellular concentrations of SEAP mRNA and protein (10, 12). 
Secreted alkaline phosphatase differs from native placental alkaline 
phosphatase (AP) only in the absence of a membrane-anchoring 
domain. The secreted nature of secreted alkaline phosphatase offers 
many advantages over other transcription reporter molecules. Most 

10 notably, it is not necessary to prepare cell lysates to assay for the 
reporter protein. Because the transfected cells remain intact, the 
kinetics of gene expression can be studied easily using the same 
cultures by sampling the medium repeatedly; therefore, set-up and 
variability for multiple cultures are essentially eliminated. 

15 Another important advantage of secreted alkaline 

phosphatase as a transcription reporter is that background signals due 
to endogenous alkaline phosphatases are nearly absent. Unlike most 
endogenous alkaline phosphatases, secreted alkaline phosphatase is 
extremely heat-stable and resistant to the inhibitor L-homoarginine. 

20 Therefore, endogenous alkaline phosphatase activity can be eliminated 
effectively by preheating the culture medium sample as 65°C and 
assaying in the presence of L-homoarginine. 

In addition, luciferase is a widely used reporter. It is one of 
most sensitive reporters and like GFP and SEAP, may be fused to a 

25 transcription factor of interest and used to track the activation of the 
transcription factor of interest. 

The prior art is deficient in the lack of effective means of 
detecting and measuring the activities of a number of transcription 
factors. Further, the prior art is deficient in the lack of a construct for 
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measuring transcription. The present invention fulfills this long- 
standing need and desire in the art. 



5 SUMMARY OF THE INVENTION 

The present invention is directed to novel cis-acting 
reporters for measuring transcription. In one embodiment of the 
present invention, there is provided a cis element-reporter construct 

10 comprising a cis element, a reporter gene and a promoter. 
Representative cis elements include nuclear factor-KB, heat responsive 
element, Myc, p53, nuclear factor of activated T cells, activator protein 
1 (API), serum responsive element, glucocorticoid responsive element 
and cAMP responsive element. Representative reporter genes include 

15 secreted alkaline phosphatase (SEAP), destabilized green fluorescent 
protein (d2EGFP) and luciferase (Luc). Representative promoters 
include thymidine kinase promoter and gonadotropin a- gene 
promoter. 

In another embodiment of the present invention, there is 
20 provided a method of generating a cis element-reporter construct by 
linking a cis element to a promoter and then fusing the element to a 
reporter gene. 

In yet another embodiment of the present invention, there 
is provided a method of monitoring activation of a transcription factor, 
25 comprising the steps of generating , a vector comprising a cis element, a 
reporter gene and a promoter, wherein the cis element is a component 
of a DNA sequence to which said transcription factor binds to ensure 
activity; transfecting a cell line with the vector; and detecting 
expression of the reporter gene, wherein expression of the reporter 
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gene indicates activation of the transcription factor. Representative 
cell lines include CHO, HEK 293, Saos2 and HeLa. 

In still yet another embodiment of the present invention, 
there is provided a method of determining whether a gene is involved 
5 in a signal transduction pathway, comprising the steps of generating a 
vector comprising a cis element, a reporter gene and a promoter, 
wherein the cis element is a component of a DNA sequence which 
affects the expression of the a gene; expressing the vector in a cell line, 
wherein the cell line is associated with the signal transduction pathway; 

■ 

10 and detecting the expression of the reporter gene, wherein expression 
of the reporter gene indicates the involvement of the gene in the signal 

transduction pathway. 

Other and further aspects, features, and advantages of the 
present invention will be apparent from the following description of 
15 the presently preferred embodiments of the invention given for the 
purpose of disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above may 

25 be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

30 Figure 1 shows the restriction map of pNF-KB-SEAP. 

5 
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Figure 2 shows the restriction map of pNF-KB-d2EGFP. 
Figure 3 shows the restriction map of pNF-xB-Luc. 
Figure 4 shows mediated induction of transcription factors 
in HEK 293 cells. Twenty four hours after transfection, cells were 
5 treated with an inducer for 6 hours. Samples were collected and 
measured for SEAP activity. Figure 4A shows TNF-mediated 
transcriptional induction. Cells were treated with 100 ng/ml TNF; NFkB 
and NFAT signaling pathways exhibit 4-fold and 10-fold induction, 
respectively, when activated by TNF. Figure 4B shows serum-mediated 
10 induction of transcription factors in HEK 293 cells. Cells were fed with 
10% serum. Multiple pathways were induced, when activated by 
' serum. In both experiments, pTK-SEAP was used as the negative 
control. (RLU = relative light units) 



15 
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DETAILED DESCRIPTION OF THE INVENTION 

The following terms have the definitions set below. 
As used herein, "cis-element" means a genetic element 
having the ability to affect the activity of a gene on the same DNA 
molecule as that genetic element. 

As used herein, "reporter" means a molecule (usually a 
protein) that is expressed in response to or as a result of a particular 

biological or molecular event. 
2 5 As used herein, "cis-acting reporter" means a protein that is 

expressed in a measurable manner in response to a molecular event 
that takes place on the same DNA strand that the DNA which codes for 
the protein is located. 
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As used herein, "transcription factor" means any factor that 
controls genetic transcription. 

As used herein, "signal transduction pathway" means the 
pathway or sequence of molecular events which converts an 
5 extracellular signal into cellular activity. 

The present invention is directed to a series of cis element- 
reporter constructs and uses thereof. 

In one embodiment of the present invention, there is 
provided a cis element-reporter construct comprising a cis element, a 
10 reporter gene and a promoter. Representative cis elements include 
nuclear factor-KB, heat responsive element, Myc, p53, nuclear factor of 
activated T cells, activator protein 1, serum responsive element, 
glucocorticoid responsive element and cAMP responsive element. In 
one preferred embodiment, the nuclear factor-KB has the sequence 
15 shown in SEQ ID No: 1. In one preferred embodiment, the heat 
responsive element has the sequence shown in SEQ ID No: 2. In one 
preferred embodiment, the Myc has the sequence shown in SEQ ID No: 
3. In one preferred embodiment, the p53 has the sequence shown in 
SEQ ID No: 4. In one preferred embodiment, the nuclear factor of 
20 activated T cells has the sequence shown in SEQ ID No: 5. In one 
preferred embodiment, the activator protein 1 has the sequence shown 
in SEQ ID No: 6. In one preferred embodiment, the serum responsive 
element has the sequence shown in SEQ ID No: 7. In one preferred 
embodiment, the glucocorticoid responsive element has the sequence 
25 shown in SEQ ID No: 8. In one preferred embodiment, the cAMP 
responsive element has the sequence shown in SEQ ID No: 9 or SEQ ID 
No: 10. Representative reporter genes include secreted alkaline 
phosphatase (SEAP), destabilized green fluorescent protein (d2EGFP) 
and luciferase (Luc). Representative promoters include thymidine 

7 
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kinase promoter, gonadotropin a-gene promoter, and other known by 

those having ordinary skill in this art. 

In another embodiment of the present invention, there is 
provided a method of generating a cis element-reporter construct by 
5 linking a cis element to a promoter and then fusing the element to a 
reporter gene. 

In yet another embodiment of the present invention, there 
is provided a method of monitoring activation of a transcription factor, 
comprising the steps of generating a vector comprising a cis element, a 

10 reporter gene and a promoter, wherein the cis element is a component 
of a DNA sequence to which said transcription factor binds to ensure 
activity; transfecting a cell line with the vector; and detecting 
expression of the reporter gene, wherein expression of the reporter 
gene indicates activation of the transcription factor. Representative 

15 cell lines include CHO, HEK 293, Saos2 and HeLa. 

In still yet another embodiment of the present invention, 
there is provided a method of determining whether a gene is involved 
in a signal transduction pathway, comprising the steps of generating a 
vector comprising a cis element, a reporter gene and a promoter, 

20 wherein the cis element is a component of a DNA sequence which 
affects the expression of a test gene; expressing the vector in a cell 
line, wherein the cell line is associated with the signal transduction 
pathway; and detecting the expression of the reporter gene, wherein 
expression of the reporter gene indicates the involvement of the gene 

25 in the signal transduction pathway. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant to 
limit the present invention in any fashion. 
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CiszEAsin&nl 

The following cis elements were utilized for constructing 

5 cis-acting reporters: 

NF-kB (nuclear factor-KB) having the sequence of GGGAA 

TTTCCGGGAATTTCCGGGAATTTCCGGGAATTTCC (SEQ ID No: 1); 

HRE (heat responsive element) having the sequence o f 
TAGAATGTTCTAGATCTAGAACATTCTAGCTAGAATGTTCTAG (SEQ ID No: 

10 2); 

Myc having the sequence of CACGTGCACGTGCACGTGCACG 

TGCACGTGCACGTG (SEQ ID No: 3); 

p53 having the sequence of TCGAGCTTGCCTGGACTTGCCTG 
GGCCAGATCTGTCGACGGAGGTCGAGCTTGCCTGGAClTGCCTGCCCACATCr 
1 5 GTCGACGGAGGTCGAGCTTGCCTGGACTTGCCTGCCCACATCTGTCGACGGAG 
GTCGAGCCTTGCCTGGACTTGCCTGCCA (SEQ ID No: 4); 

NFAT (nuclear factor of activated T cells) having the 
sequence of GGAGGAAAAACTGTTTC ATACAGAAGGCGTGGAGGAA AAA 
CTGTTTCATACAGAAGGCGTGGAGGAAAAACTGTTTCATACAGAAGGCGT 

20 (SEQ ID No: 5); 

API (activator protein 1) is an active protein and its activity 
is composed primarily by Jun/Fos heterodimers as well as Jun 
homodimers. The activity of API responds to growth factors, such as 
phorbol ester (TP A), protein kinase C. Apl having the sequence of 

25 TGAGTCAGTGAGTCACTGACTCACTGACTCATGAGTCAGCTG 

ACTC (SEQ ID No: 6); 

SRE (serum responsive element) having the sequence of 
GATGTCCATATTAGGACATCGATGTCCGAATATGGACATCGATGTCCATATT 

AGGACATC (SEQ ID No: 7); 
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GRE (glucocorticoid responsive element) having the 
sequence of GGTACATTTTGTTCTAGAACAAAATGTACCGGTACATTTTGT 

TCT (SEQ ID No: 8); 

CRE stands for cAMP responsive element. Two CRE elements 
5 were utilized: CRE3 (GCACCAGACAGTGACGTCAGCTGCCAGAT 

CCCATGGCCGTCATACTGTGACGTCTTTCAGACACCCCATTGACGTCAATGGG 

AGAAC: SEQ ID No: 9) and CRE5 (TCGAGCCCATGGCCGTCATACTGTGACG 
TCCCCATGGCCGTCATACTGTGACGTCCCCATGGCCGTCATACTGTGACGTCC 
CCATGGTCGTCATACTGTGACGTCCCCATGGCCGTCATACTGTGACGTC: SEQ 

10 ID No: 10). 



F.YAMPT.K2 



r,c-F.1pmp.nt/SHAP Constructs 

15 A number of cis elements were linked to a secreted alkaline 

phosphatase reporter wherein the secreted alkaline phosphatase wa 
used as a reporter. One advantage of using SEAP as a reporter is that 
SEAP is secreted into the medium and the medium itself can be 
collected for the assay without disrupting the cells. 

20 In a AP1-SEAP construct, the cis element in the construct 

contains six copies of API binding sequence. In a SRE-SEAP construct, 
the construct contains three copies of SRE element. In a CRE-SEAP 
construct, the construct contains three different cAMP responsive 
elements, responding to cAMP, protein kinase A, forskolin etc. In a 

25 GRE-SEAP construct, the construct contains three copies of 
glucocorticoid responsive element and responds to the glucocorticoid- 
activated glucocorticoid receptor. In a HRE-SEAP construct, the 
construct contains three copies of heat responsive element. It 
responds to a number of types of physiological stress, such as heat. In 

30 an NF-kB-SEAP construct, the construct contains four copies of the NF- 

1 0 
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kB binding sequence and responds to a number of stimuli, including 
TNF. In a NFAT-SEAP construct, the construct contains three copies of 
NFAT binding sequence and responds to PMA, ionomycin, PKC, etc. In a 
Myc-SEAP construct, the construct contains six copies of myc binding 
sequence and responds to myc activation. In a P53-SEAP construct, the 
construct contains two copies of p53 binding sequence, reporting p53 



activation. 



pNEzsBrSEAP. Construct 

pNF-KB-SEAP (SEQ ID No:ll) is especially designed for 
monitoring the activation of NF-kB signal transduction pathway (13- 
16). The restriction map of pNF-xB-SEAP is shown in Figure 1. 

15 Induction of the pathway enables endogenous NF-kB to bind to the 
kappa (k) enhancer element (KB 4 ) located in the promoter region of 
the vector, thus activating transcription of the reporter gene. 

pNF-KB-SEAP allows expression of the secreted alkaline 
phosphatase reporter gene (17-19). The secreted alkaline phosphatase 

20 reporter coding sequence is followed by the SV40 late polyadenylation 
signal to ensure proper, efficient processing of the secreted alkaline 
phosphatase reporter transcript in eukaryotic cells. A synthetic 
transcription blocker (TB), composed of adjacent polyadenylation and 
transcription pause sites located upstream of the K-enhancer, reduces 

25 background transcription (20). The vector backbone also contains an 
fl origin for single-stranded DNA production, a pUC origin of 
replication, and an ampicillin resistance gene for propagation and 
selection in E.coli. The SEAP vectors incorporate a number of 
additional features that improve the sensitivity of secreted alkaline 



1 1 



WO 00/34435 PCT/US99/28451 

phosphatase reporter by increasing the efficiency of secreted alkaline 
phosphatase reporter expression or that enhance the utility of the 
vectors. These include: the removal of the SV40 small-t intron, which 
can cause cryptic splicing and reduced expression in some genes 
and/or cell types (21-22); switching from the early to late 
polyadenylation signal of SV40, which typically causes a five-fold 
increase in mRNA levels (23); compact plasmid size; and removal of 
extraneous sequence from the 3' untranslated region of the SEAP 
mRNA. 

FYAMPT.F.4 

C i c-F.1 p.mp-nt /FnFP Construct 

EGFP was also used as a reporter to analyze transcription 
factors. Destabilized EGFP (d2EGFP) is an autofluorescence protein 
wherein fluorescence emission of this protein does not require any 
additional substrates or cofactors. d2EGFP is an ideal reporter to be 
used in cell-based assays for high throughput drug screening as it 
allows real-time monitoring of activation of transcription factors. The 
detection can be made in multiple points. 

* 

A set of d2EGFP reporters with different cis-elements were 
generated, which are used for monitoring different transcription 
factors. Establishment of stable cell lines that express individual 
reporters expands application of this cis-element reporter in the cell- 
based high throughput drug screening. 

An NF-KB-d2EGFP construct also is used to monitor the 
activation of NF-kB. This construct responds to a number of stimuli, 

■ 

including TNF and ILL The APl-d2EGFP construct is used to monitor 
the activation of API, which responds to growth factors. Additionally, 
an SRE-d2EGFP construct is used to monitor the activation of E1K1. This 

1 2 
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construct responds to serum. The GRE-d2EGFP construct was used t o 
monitor the activation of glucocorticoid receptors and responds to 
glucocorticoid activation. The CRE-d2EGFP construct was used to 
monitor CREB activation and responds to cAMP and cAMP-mediated 
5 protein kinase A (PKA). An NFAT-d2EGFP construct was used to 
monitor Ca++-mediated NFAT activation which responds to Ca++. The 
HSE-d2EGFP construct is used to monitor HSF activation and responds 
to a stress, such as heat. The P53-d2EGFP construct is used to monitor 
p53 activation, responding to DNA damage. The Myc-d2EGFP construct 

10 is used to monitor Myc activation, which responds to cell growth. 

To construct the cis-acting d2EGFP reporters, the binding 
sequences of cis element was fused to the thymidine kinase (TK) 
promoter and d2EGFP gene. Resulting constructs are able to monitor 
the activation of a number of transcription factors. In CRE-d2EGFP 

15 transfected CHO cells, forskolin treatment elevates intracellular cAMP 
resulting in induction of d2EGFP. In NFAT-d2EGFP transfected HEK 293 
cells, ionomycin and PMA treatment activates NFAT resulting in 
induction of d2EGFP. In GRE-d2EGFP transfected Saos2 cells, 
dexamethasone treatment activates glucocorticoid receptor (GR) 

20 resulting induction of d2EGFP. 

EXAMPLE 5 

r NF-icR-d7.F.nFP Construct 

25 By linking NF-kB binding sequences and the thymidine 

kinase promoter, d2EGFP gene was able to detect TNF-mediated 
activation of NF-kB. pNF-KB -d2EGFP (SEQ ID No: 12) was designed to 
monitor the activation of the NF-kB signal transduction pathway (13- 
16). The restriction map of pNF-KB-d2EGFP is shown in Figure 2. 
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Induction of the pathway enables endogenous NF-kB to bind to the 
kappa (k) enhancer element (KB 4 ) located in the promoter region of 
the vector, thus activating transcription of the reporter gene. 

d2EGFP is a destabilized variant of the enhanced green 

5 fluorescent protein. Unlike the original EGFP protein which is 
extremely stable, d2EGFP has a half-life of approximately two hours in 
vivo (24). Therefore, d2EGFP provides a more accurate measurement 
of transient activation by NF-kB than unmodified EGFP and is ideal for 
kinetic studies of gene activation. To construct d2EGFP, residues 422- 

10 461 of mouse ornithine decarboxylase (MODC) were fused to the C- 
terminus of EGFP. This region of the mouse ornithine decarboxylase 
contains a PEST amino acid sequence that targets the protein for 
degradation results in rapid protein turnover. pNF-KB-d2EGFP allows 
for real-time analysis of pNF-KB activation by either fluorescence 

15 microscopy or flow cytometry. 

The d2EGFP coding sequence is followed by the SV40 late 
polyadenylation signal to ensure proper, efficient processing of the 
d2EGFP transcript in eukaryotic cells. A synthetic transcription blocker 
(TB), composed of adjacent polyadenylation and transcription pause 

20 sites located upstream of the k enhancer reduces background 
transcription (20). The vector backbone also contains fl origin for 
single-stranded DNA production, a pUC origin of replication, and an 
ampicillin resistance gene for propagation and selection in E.coli. The 
d2EGFP vectors incorporate a number of features that improve the 

25 sensitivity of d2EGFP by increasing the efficiency of d2EGFP expression 
or that enhance the utility of the vectors. These include: the removal 
of the SV40 small-t intron, which can cause cryptic splicing and 
reduced expression in some genes and/or cell types (21-22); switching 
from the early to late polyadenylation signal of SV40, which typically 
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causes a five-fold increase in mRNA levels (23); compact plasmid size; 
and removal of extraneous sequence from the 3' untranslated region of 
the d2EGFP mRNA. 

5 F.XAMPTE6 

CRE5=d2EGE E Construct 

pCRE5-d2EGFP (SEQ ID No: 13) was constructed using the 
pTK-SEAP as the backbone. The thymidine kinase promoter was 

10 substituted by 5 copies of CREs (cAMP response elements) fused with 
gonadotropin cc-gene promoter (also called Thyroid-stimulating 
hormone TSHa subunit). The d2EGFP reporter gene was used instead of 
secreted alkaline phosphatase. This construct was designed to monitor 
the cAMP level changes in vivo. cAMP elevation within cells upon 

15 stimulation should result in the phosphorylation of CREB, which 
consequenly induces the transcription and translation of the reporter 
gene d2EGFP. The destabilized form of EGFP eliminated the 
accumulation of EGFP in cells and better represented the changes of its 
expression level upon stimulation. 

20 The stable cell line was generated by stably transfected 

pCRE5-d2EGFP together with pSV2-neo into CHO-K1 cells. Transfected 
cells were selected under G418 at a 500 ng/ml concentration and 
individual clones were isolated and analyzed for forskolin induction. 
Three cell clones were selected which show a time course as well as the 

25 positive response to forskolin. Clone CRE5-dE #3 has the highest 
fluorescence intensity and is the easiest to detect induction, since it 
had five fold increase of fluorescence after induction. Clone CRE5-dE 
#38 has the highest fold of induction (10 fold) and the lowest 
background EGFP expression. However, after induction, Clone CRE5-dE 
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#38 had a lower fluorescent intensity than that of clone #3. Clone #3 8- 
2 is an intermediate of above two clones, having an 8 fold fluorescence 
induction and, after induction, the fluorescence intensity was at the 
similar level of clone #3. These cell clones can be used in cell based 
5 high-throughput screening in search of factors involved in cAMP signal 
transduction pathway. 

EXAMPLE 7 

10 ry^Rlftmftn t/Tjiciferasft Construct 

Luciferase was also used as a reporter to analyze 
transcription factors. Luciferase is one of most sensitive reporters and 
is widely used in the art. Due to its sensitivity and popularity, 
luciferase was selected to link to a cis element to form a cis- 
15 element/luciferase construct for monitoring the activation of 
transcription factors. 

pNF-KB-Luciferase (pNF-KB-Luc, SEQ ID No: 14) contains a 

firefly luciferase gene and is designed for monitoring the activation of 
the NF-kB signal transduction pathway (13-16). The restriction map of 

20 pNF-KB-Luc is shown in Figure 3. Induction of the pathway enables 
endogenous NF-kB to bind to the kappa (k) enhancer element (KB 4 ) 
located in the promoter region of the vector, thus activating 
transcription of the reporter gene. The luciferase coding sequence is 
followed by the SV40 late polyadenylation signal to ensure proper, 

25 efficient processing of the luciferase transcript in eukaryotic cells. A 
synthetic transcription blocker, composed of adjacent polyadenylation 
and transcription pause sites located upstream of the K-enhancer, 
reduces background transcription (20). The vector backbone also 
contains an fl origin for single-stranded DNA production, a pUC origin 
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of replication, and an ampicillin resistance gene for propagation and 

selection in E.coli. 

The pNF-KB vectors incorporate a number of features that 

improve the sensitivity of lucif erase by increasing the efficiency of 
5 luciferase expression or that enhance the utility of the vectors. These 
include: the removal of the SV40 small-t intron, which can cause 
cryptic splicing and reduced expression in some genes and/or cell 
types (21-22); switching from the early to late polyadenylation signal 
of SV40, which typically causes a five-fold increase in mRNA levels 
10 (23); compact plasmid size; and removal of extraneous sequence from 
the 3' untranslated region of the luciferase mRNA. 



FYAMPIK8 

15 A pr n^Hnn o f rvcArtinp Reporters in Assaying the Activity of 

T >ngrriptir> r Eactors i" Signal Pathways 

The activity assay of the present invention may be carried 
out in a 12 or 24 well plate. This assay can be applied to a lot of 
studies, such as examining whether a new target gene links to any 

20 signal pathways. The assay can be used for establishing a functional 
status profile of transcription factors for evaluating any biological 
difference in vivo, such as normal cells versus differentiated, 
apoptotic, and cancer cells. The assays of the present invention can 
also be used to examine and quickly identify biological markers for 

25 human diseases. Drug validation is another application of the present 
assay. 

Figure 4 demonstrates mediated induction of transcription 
factors in HEK 293 cells. Figure 4A shows tumor necrosis factor (TNF)- 
mediated transcriptional induction. Human embryonic kidney (HEK 
30 293) cells were transfected individually with each reporter vector and 
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incubated for 24 hr in serum-free media. After incubation with TNF for 

« 

6 hr, samples were collected and measured for SEAP activity. The 
transcriptional profile reveals that, as expected, TNF activated the NFkB 
and the NFAT signaling pathways, which are known to play pivotal roles 
5 in the transcription of genes involved in the immune response (1-4). 
Other transcription factors and controls were unaffected by TNF 
stimulation. 

Figure 4B shows serum-mediated induction of transcription 
factors HEK293 cells. After transfection, cells were cultured in serum- 

10 free media for 24 hr and incubated with 10% serum for 6 hr. Samples 
were collected and measured for SEAP activity as described above. 
Unlike the TNF-mediated activation of specific pathways (Figure 4A), 
serum activates multiple transcription factors that induce a variety of 
signaling events after stimulation. 

15 The following references were cited herein. 

1. Chalfie, et al M (1994) Science 263: 802-805. 

2. Prasher, et al., (1992) Gene 111: 229-233. 

3. Inouye, et al., (1994) FEBS Letters 341: 277-280. 

4. Wang, et al., (1994) Nature 369: 400-403. 

20 5. Yang, et al., (1996) Nucleic Acid Res. 24(22): 4592-4593. 

6. Cormack, et al., (1996) Gene 173:. 33-38. 

7. Hass, et al., (1996) Curr. Biol. 6: 315-324. 

8. Galbraith, et al., (1995) Methods. Cell Biol. 50: 1-12. 

9. Living Colors Destabilized EGFP Vectors (April 1998), 
25 CLONTECHniques XIII(2): 16-17. 

10; Berger, et al., (1998) Gene 66: 1-10. 
11. Kain, et al., (1997) Methods Mol. Biol. 63: 49-60. 
12 Cullen, et al., (1992) Methods Enzymol. 216: 362-368. 
13. Baeuerle, P.A. & Baltimore, D. (1996) Cell 87:13-20 
30 14. Baeuerle, P.A. (1998) Curr. Biol. 8:R19-R2 
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15. Peltz, G. (1997) Curr. Opin. Biotechnol. 8:467-473 

16. Israel, A. (1997) Nature 388: 519-521 

17. Berger, J. et al. (1998) Gene 66:1-10 

18. Kain, S.R. (1997) Methods Mol. Biol. 63:49-60 

5 19. Cullen, B.R. & Malim, M.H. (1992) Methods Enzymol 216:362- 

368 

20. Eggermont, J. & Proudfoot, N. (1993) EMBO J. 12:2539-2548 

21. Huang, M.T.F. & Gorman, CM. (1990) MoL Cell. Biol. 10:1805- 
1810 

10 22. Evans, MJ. & Scarpulla, R.C. (1989) Gene 84:135-142 

23. Carswell, S. & Alwine, J.C (1989) Mol. Cell. Biol, 9:5248-4258 

Any patents or publications mentioned in this specification 
are indicative of the levels of those skilled in the art to which the 

15 invention pertains. These patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be 
incorporated by reference. 

One skilled in the art will readily appreciate that the 

20 present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those inherent therein. 
The present examples along with the methods, procedures, treatments, 
molecules, and specific compounds described herein are presently 
representative of preferred embodiments, are exemplary, and are not 

25 intended as limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention as defined by the scope 
of the claims. 

19 



WO 00/34435 

WHAT IS CLAIMED IS: 
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1 . A construct for measuring transcription, comprising: 

a cis element, wherein said cis element is selected from the 
5 group consisting of nuclear factor-KB having the sequence shown in 
SEQ ID No: 1, heat responsive element having the sequence shown in 
SEQ ID No: 2, Myc having the sequence shown in SEQ ID No: 3, p5 3 
having the sequence shown in SEQ ID No: 4, nuclear factor of activated 
T cells having the sequence shown in SEQ ID No: 5, alkaline 
10 phosphatase 1 having the sequence shown in SEQ ID No: 6, serum 
responsive element having the sequence shown in SEQ ID No: 7, 
glucocorticoid responsive element having the sequence shown in SEQ 
ID No: 8 and cAMP responsive element having the sequence selected 
from the group consisting of SEQ ID No: 9 and SEQ ID No: 10; 
15 a reporter gene; and 

a promoter. 

2. The construct of claim 1, wherein said reporter gene 
20 is selected from the group consisting of secreted alkaline phosphatase, 

destabilized green fluorescent protein and luciferase. 

3. The construct of claim 1, wherein said promoter is 
25 selected from the group consisting of thymidine kinase promoter and 

gonadotropin a-gene promoter. 

4. A vector comprising the construct of claim 1. 

30 

20 
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5. 

SEAP. 
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The vector of claim 4, wherein said vector is pNF-KB- 



5 6. The vector of claim 4, wherein said vector is pNF-icB- 

d2EGFP. 

7. The vector of claim 4, wherein said vector is pNF-icB- 

10 Luc. 

* 

8 . A method of monitoring activation of a transcription 

factor, comprising the steps of: 
15 generating, the construct of claim 1, wherein said cis- 

element is a transcription factor; 

> 

transforming a cell line with a vector containing said 
construct; and 

detecting the expression of said reporter gene, wherein 
20 expression of said reporter gene indicates the activation of said 
transcription factor. 

9. A method of monitoring activation of a transcription 
factor, comprising the steps of: 

25 generating the construct of claim 2; 

transforming a cell line with a vector containing said 

construct; and 



21 



WO 00/34435 PCT/US99/28451 

detecting the expression of said reporter gene, wherein 
expression of said reporter gene indicates the activation of said 
transcription factor. 

m 

10. The method of claim 9, wherein said cell line is 
selected from the group consisting of CHO, HEK 293 and Saos2. 



1 1 . The method of claim 9, wherein said construct is 
selected from the group consisting of pNF-xB-SEAP, pNF-KB -d2EGFP and 
pNF-icB-Luc. 
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SEQUENCE LISTING 

<110> Li, Xianqiang 

Zhao , Xiaoning 
Fang , Yu 
Jiang, Xin 
Duong , Tommy 
Kain, Steve R. 

<120> Cis-Element Reporter Constructs and Uses Thereof 

<130> D6170PCT 

<141> 1999-12-01 

<150> US 09/206,887 

<151> 1998-12-08 

<160> 14 



<210> 1 

<211> 40 

<212> DNA 

<213> artificial sequence 
<220> 

<223> NF-KB element 

<400> 1 
gggaatttcc gggaatttcc gggaatttcc gggaatttcc 



<210> 2 

<211> 43 

<212> DNA 

<213> artificial sequence 
<220> 

<223> heat responsive element (HRE) 

<400> 2 

tagaatgttc tagatctaga acattctagc tagaatgttc tag 43 

<210> 3 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Myc element 
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<400> 3 

cacgtgcacg tgcacgtgca cgtgcacgtg cacgtg 3 6 



<210> 4 

<211> 157 

<212> DNA 

<213> artificial sequence 

t 

<220> 

<223> p53 cis element 

<400> 4 

tcgagcttgc ctggacttgc ctgggccaga tctgtcgacg gaggtcgagc 50 
ttgcctggac ttgcctgccc acatctgtcg acggaggtcg agcttgcctg 100 
gacttgcctg cccacatctg tcgacggagg tcgagccttg cctggacttg 150 
cctgcca 



157 



<210> 


5 


<211> 


90 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


nuclear factor of activated 


<400> 


5 



ggaggaaaaa ctgtttcata cagaaggcgt ggaggaaaaa ctgtttcata 50 
cagaaggcgt ggaggaaaaa ctgtttcata cagaaggcgt 90 



<210> 6 

<211> 46 

<212> DNA 

<213> artificial sequence 
<220> 

<223> alkaline phosphatase 1 (API) 

<400> 6 

tgagtcagtg agtcactgac tcactgactc atgagtcagc tgactc 46 



<210> 7 
<211> 60 
<212> DNA 
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<213> artificial sequence 

<220> 

<223> serum responsive element (SRE) 

<400> 7 

gatgtccata ttaggacatc gatgtccgaa tatggacatc gatgtccata 50 



ttaggacatc 



60 



<210> 8 

<211> 45 

<212> DNA 

<213> artificial sequence 
<220> 

<223> glucocorticoid responsive element (GRE) 

<400> 8 

ggtacatttt gttctagaac aaaatgtacc ggtacatttt gttct 45 



<210> 9 
<211> 87 
<212> DNA 

<213> artificial sequence 

<220> 

<223> cAMP responsive element (CRE3) 

<400> 9 

gcaccagaca gtgacgtcag ctgccagatc ccatggccgt catactgtga 50 
cgtctttcag acaccccatt gacgtcaatg ggagaac 87 



<210> 10 

<211> 130 * 

<212> DNA 

<213> artificial sequence 

<220> 

<223> cAMP responsive element (CRE5) 

<400> 10 

tcgagcccat ggccgtcata ctgtgacgtc cccatggccg tcatactgtg 50 
acgtccccat ggccgtcata ctgtgacgtc cccatggtcg tcatactgtg 100 
acgtccccat ggccgtcata ctgtgacgtc 130 
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<210> 



11 



<211> 



4846 



<212> 



DNA 



<213> 



artificial sequence 



<220> 



<223> 



synthesized pNF-KB-SEAP construct 



<400> 



11 



ggtaccgagc tcttacgcgt gctagcggga atttccggga atttccggga 
atttccggga atttccagat ctgccgcccc gactgcatct gcgtgttcga 
attcgccaat gacaagacgc tgggcggggt ttgtgtcatc atagaactaa 
agacatgcaa atatatttct tccggggaca ccgccagcaa acgcgagcaa 
cgggccacgg ggatgaagca gaagcttcga atcgcgaatt cgcccaccat 
gctgctgctg ctgctgctgc tgggcctgag gctacagctc tccctgggca 
tcatcccagt tgaggaggag aacccggact tctggaaccg cgaggcagcc 
gaggccctgg gtgccgccaa gaagctgcag cctgcacaga cagccgccaa 
gaacctcatc atcttcctgg gcgatgggat gggggtgtct acggtgacag 
ctgccaggat cctaaaaggg cagaagaagg acaaactggg gcctgagata 
cccctggcca tggaccgctt cccatatgtg gctctgtcca agacatacaa 
tgtagacaaa catgtgccag acagtggagc cacagccacg gcctacctgt 
gcggggtcaa gggcaacttc cagaccattg gcttgagtgc agccgcccgc 
tttaaccagt gcaacacgac acgcggcaac gaggtcatct ccgtgatgaa 
tcgggccaag aaagcaggga agtcagtggg agtggtaacc accacacgag 
tgcagcacgc ctcgccagcc ggcacctacg cccacacggt gaaccgcaac 
tggtactcgg acgccgacgt gcctgcctcg gcccgccagg aggggtgcca 
ggacatcgct acgcagctca tctccaacat ggacattgac gtgatcctag 
gtggaggccg aaagtacatg tttcgcatgg gaaccccaga ccctgagtac 
ccagatgact acagccaagg tgggaccagg ctggacggga agaatctggt 
gcaggaatgg ctggcgaagc gccagggtgc ccggtatgtg tggaaccgca 
ctgagctcat gcaggcttcc ctggacccgt ctgtgaccca tctcatgggt 
ctctttgagc ctggagacat gaaatacgag atccaccgag actccacact 
ggacccctcc ctgatggaga tgacagaggc tgccctgcgc ctgctgagca 
ggaacccccg cggcttcttc ctcttcgtgg agggtggtcg catcgaccat 
ggtcatcatg aaagcagggc ttaccgggca ctgactgaga cgatcatgtt 
cgacgacgcc attgagaggg cgggccagct caccagcgag gaggacacgc 
tgagcctcgt cactgccgac cactcccacg tcttctcctt cggaggctac 
cccctgcgag ggagctccat cttcgggctg gcccctggca aggcccggga 
caggaaggcc tacacggtcc tcctatacgg aaacggtcca ggctatgtgc 
tcaaggacgg cgcccggccg gatgttaccg agagcgagag cgggagcccc 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
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gagtatcggc agcagtcagc agtgcccctg 
cgaggacgtg gcggtgttcg cgcgcggccc 
gcgtgcagga gcagaccttc atagcgcacg 
ctggagccct acaccgcctg cgacctggcg 
cgccgcgcac ccgggttact ctagagtcgg 
agacatgata agatacattg atgagtttgg 
agtgaaaaaa atgctttatt tgtgaaattt 
gtaaccatta taagctgcaa taaacaagtt 
ttttatgttt caggttcagg gggaggtgtg 
aaaacctcta caaatgtggt aaaatcgata 
cttgagagcc ttcaacccag tcagctcctt 
ctatcgtcgc cgcacttatg actgtcttct 
caggtgccgg cagcgctctt ccgcttcctc 
cggtcgttcg gctgcggcga gcggtatcag 
cggttatcca cagaatcagg ggataacgca 
aggccagcaa aaggccagga accgtaaaaa 
tccataggct ccgcccccct gacgagcatc 
cagaggtggc gaaacccgac aggactataa 
tggaagctcc ctcgtgcgct ctcctgttcc 
acctgtccgc ctttctccct tcgggaagcg 
cgctgtaggt atctcagttc ggtgtaggtc 
tgtgcacgaa ccccccgttc agcccgaccg 
atcgtcttga gtccaacccg gtaagacacg 
gccactggta acaggattag cagagcgagg 
gttcttgaag tggtggccta actacggcta 
gtatctgcgc tctgctgaag ccagttacct 
tcttgatccg gcaaacaaac caccgctggt 
caagcagcag attacgcgca gaaaaaaagg 
tcttttctac ggggtctgac gctcagtgga 
attttggtca tgagattatc aaaaaggatc 
ttaaaaatga agttttaaat caatctaaag 
ctgacagtta ccaatgctta atcagtgagg 
ctatttcgtt catccatagt tgcctgactc 
gatacgggag ggcttaccat ctggccccag 
acccacgctc accggctcca gatttatcag 
agggccgagc gcagaagtgg tcctgcaact 
tattaattgt tgccgggaag ctagagtaag 
tgcgcaacgt tgttgccatt gctacaggca 
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gacgaagaga cccacgcagg 1600 
gcaggcgcac ctggttcacg 1650 
tcatggcctt cgccgcctgc 1700 
ccccccgccg gcaccaccga 1750 
ggcggccggc cgcttcgagc 1800 
acaaaccaca actagaatgc 1850 
gtgatgctat tgctttattt 1900 
aacaacaaca attgcattca 1950 
ggaggttttt taaagcaagt 2000 
aggatccgtc gaccgatgcc 2050 
ccggtgggcg cggggcatga 2100 
ttatcatgca actcgtagga 2150 
gctcactgac tcgctgcgct 2200 
ctcactcaaa ggcggtaata 2250 
ggaaagaaca tgtgagcaaa 2300 
ggccgcgttg ctggcgtttt 2350 
acaaaaatcg acgctcaagt 2400 
agataccagg cgtttccccc 2450 
gaccctgccg cttaccggat 2500 
tggcgctttc tcatagctca 2550 
gttcgctcca agctgggctg 2600 
ctgcgcctta tccggtaact 2650 
acttatcgcc actggcagca 2700 
tatgtaggcg gtgctacaga 2750 
cactagaagg acagtatttg 2800 
tcggaaaaag agttggtagc 2850 
agcggtggtt tttttgtttg 2900 
atctcaagaa gatcctttga 2950 
acgaaaactc acgttaaggg 3000 
ttcacctaga tccttttaaa 3 050 
tatatatgag taaacttggt 3100 
cacctatctc agcgatctgt 3150 
cccgtcgtgt agataactac 3200 
tgctgcaatg ataccgcgag 3250 
caataaacca gccagccgga 3300 
ttatccgcct ccatccagtc 3350 
tagttcgcca gttaatagtt 3400 
tcgtggtgtc acgctcgtcg 3450 
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tttggtatgg 


cttcattcag 


ctccggttcc 


caacgatcaa 


ggcgagttac 


3500 


atgatccccc 


atgttgtgca 


aaaaagcggt 


tagctccttc 


ggtcctccga 


3550 


tcgttgtcag 


aagtaagttg 


gccgcagtgt 


tatcactcat 


ggttatggca 


3600 


gcactgcata 


attctcttac 


tgtcatgcca 


tccgtaagat 


gcttttctgt 


3650 


gactggtgag 


tactcaacca 


agtcattctg 


agaatagtgt 


atgcggcgac 


3700 


cgagttgctc 


ttgcccggcg 


tcaatacggg 


ataataccgc 


gccacatagc 


3750 


agaactttaa 


aagtgctcat 


cattggaaaa 


cgttcttcgg 


ggcgaaaact 


3800 


ctcaaggatc 


ttaccgctgt 


tgagatccag 


ttcgatgtaa 


cccactcgtg 


3850 


cacccaactg 


atcttcagca 


tcttttactt 


tcaccagcgt 


ttctgggtga 


3900 


gcaaaaacag 


gaaggcaaaa 


tgccgcaaaa 


aagggaataa 


gggcgacacg 


3950 


gaaatgttga 


atactcatac 


tcttcctttt 


tcaatattat 


tgaagcattt 


4000 


atcagggtta 


ttgtctcatg 


agcggataca 


tatttgaatg 


tatttagaaa 


4050 


aataaacaaa 


taggggttcc 


gcgcacattt 


ccccgaaaag 


tgccacctga 


4100 


cgcgccctgt 


agcggcgcat 


taagcgcggc 


gggtgtggtg 


gttacgcgca 


4150 


gcgtgaccgc 


tacacttgcc 


agcgccctag 


cgcccgctcc 


tttcgctttc 


4200 


ttcccttcct 


ttctcgccac 


gttcgccggc 


tttccccgtc 


aagctctaaa 


4250 


tcgggggctc 


cctttagggt 


tccgatttag 


tgctttacgg 


cacctcgacc 


4300 


ccaaaaaact 


tgattagggt 


gatggttcac 


gtagtgggcc 


atcgccctga 


4350 


tagacggttt. ttcgcccttt 


gacgttggag 


tccacgttct 


ttaatagtgg 


4400 


actcttgttc 


caaactggaa 


caacactcaa 


ccctatctcg 


gtctattctt 


4450 


ttgatttata 


agggattttg 


ccgatttcgg 


cctattggtt 


aaaaaatgag 


4500 


ctgatttaac 


aaaaatttaa 


cgcgaatttt 


aacaaaatat 


taacgtttac 
— * 


4550 


aatttcccat 


tcgccattca 


ggctgcgcaa 


ctgttgggaa 


gggcgatcgg 


4600 


tgcgggcctc 


ttcgctatta 


cgccagccca 


agctaccatg 


ataagtaagt 


4650 


aatattaagg 


tacgggaggt 


acttggagcg 


gccgcaataa 


aatatcttta 


4700 


ttttcattac 


atctgtgtgt 


tggttttttg 


tgtgaatcga 


tagtactaac 


4750 


atacgctctc 


catcaaaaca 


aaacgaaaca 


aaacaaacta 


gcaaaatagg 


4800 


ctgtccccag 


tgcaagtgca 


ggtgccagaa 


catttctcta 


tcgata 


4846 


<210> 


12 










<211> 


4163 










<212> 


DNA 


— 









<213> artificial sequence 

<220> 

<223> synthesized pNF-KB-d2EGFP construct 

<400> 12 

ggtaccgagc tcttacgcgt gctagcggga atttccggga atttccggga 50 
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atttccggga 


atttccagat 


ctacccrcccc 


gactgcatct 


gcgtgttcga 


100 


attcgccaat gacaagacgc 


tggacggggt 


ttgtgtcatc 


atagaactaa 


150 


agacatgcaa 


atatatttct 


tccggggaca 


ccgccagcaa 


acgcgagcaa 


200 


cgggccacgg 


ggatgaagca 


gaagcttccc 


cgaattccgg 


atggtgagca 


250 


agggcgagga 


gctgttcacc 


ggggtggtgc 


ccatcctggt 


cgagctggac 


300 


ggcgacgtaa 


acggccacaa 


gttcagcgtg 


tccggcgagg 


gcgagggcga 

«apf pp» 


350 


tgccacctac 


ggcaagctga 


ccctgaagtt 


catctgcacc 


accggcaagc 


400 


tgcccgtgcc 


ctggcccacc 


ctcgtgacca 


ccctgaccta 


cggcgtgcag 


450 


tgcttcagcc 


gctaccccga 


ccacatgaag 


cagcacgact 


tcttcaagtc 


500 


cgccatgccc 


gaaggctacg 


tccaggagcg 


caccatcttc 


ttcaaggacg 

^ ^ pppp 


550 


acggcaacta 


caagacccgc 

p*^ 


gccgaggtga 


agttcgaggg 


cgacaccctg 


600 


gtgaaccgca 


tcgagctgaa 


gggcatcgac 


ttcaaggagg 


acggcaacat 


650 


cctggggcac 


aagctggagt 


acaactacaa 


cagccacaac 


gtctatatca 


700 


tcrgccgacaa 


gcagaagaac 


ggcatcaagg 


tgaacttcaa 


gatccgccac 

^pp 


750 


aacatcgagg 


acggcagcgt 


acagctcgcc 


gaccactacc 


agcagaacac 


800 


ccccatcggc 


gacggccccg 


tgctgctgcc 


cgacaaccac 


tacctgagca 


850 


cccagtccgc 


cctgagcaaa 


gaccccaacg 


agaagcgcga 


tcacatggtc 


900 


ctgctggagt 


tcgtgaccgc 


cgccgggatc 


actctcggca 


tggacgagct 


950 


gtacaagaag 


cttagccatg 


gcttcccgcc 


ggaggtggag 


gagcaggatg 


1000 


atggcacgct 


gcccatgtct 

«p^ 


tgtgcccagg 


agagcgggat ggaccgtcac 


1050 


cctgcagcct 


gtgcttctgc 


taggatcaat 


gtgtagtcta gagtcggggc 


1100 


ggccggccgc 


ttcgagcaga 


catgataaga 


tacattgatg 


agtttggaca 


1150 


aaccacaact 


agaatgcagt 

* 


gaaaaaaatg 


ctttatttgt 


gaaatttgtg 


1200 


atgctattgc 


tttatttgta 


accattataa 


gctgcaataa 


acaagttaac 


1250 


aacaacaatt 


gcattcattt 


tatgtttcag 


gttcaggggg 


aggtgtggga 


1300 


ggttttttaa agcaagtaaa 


acctctacaa 


atgtggtaaa 


atcgataagg 


1350 


atccgtcgac 


cgatgccctt 


gagagccttc 


aacccagtca 


gctccttccg 


1400 


gtgggcgcgg ggcatgacta 


tcgtcgccgc 


acttatgact 


gtcttcttta 


1450 


tcatgcaact 


cgtaggacag 


gtgccggcag 


cgctcttccg 


cttcctcgct 


1500 


cactgactcg 


ctgcgctcgg 


tcgttcggct 


gcggcgagcg gtatcagctc 


1550 


actcaaaggc 


ggtaatacgg 


ttatccacag 


aatcagggga 


taacgcagga 


1600 


aagaacatgt 


gagcaaaagg 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


1650 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


cccccctgac 


gagcatcaca 


1700 


aaaatcgacg 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


actataaaga 


1750 


taccaggcgt 


ttccccctgg 


aagctccctc 


gtgcgctctc 


ctgttccgac 


1800 


cctgccgctt 


accggatacc 




tctcccttcg ggaagcgtgg 


1850 


cgctttctca 


tagctcacgc 


tgtaggtatc 


tcagttcggt 


gtaggtcgtt 


1900 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


1950 



SEQ 7/14 



WO 00/34435 

cgccttatcc ggtaactatc gtcttgagtc 
tatcgccact ggcagcagcc actggtaaca 
gtaggcggtg ctacagagtt cttgaagtgg 
tagaaggaca gtatttggta tctgcgctct 
gaaaaagagt tggtagctct tgatccggca 
ggtggttttt ttgtttgcaa gcagcagatt 
tcaagaagat cctttgatct tttctacggg 
aaaactcacg ttaagggatt ttggtcatga 
acctagatcc ttttaaatta aaaatgaagt 
atatgagtaa acttggtctg acagttacca 
ctatctcagc gatctgtcta tttcgttcat 
gtcgtgtaga taactacgat acgggagggc 
tgcaatgata ccgcgagacc cacgctcacc 
taaaccagcc agccggaagg gccgagcgca 
tccgcctcca tccagtctat taattgttgc 
ttcgccagtt aatagtttgc gcaacgttgt 
tggtgtcacg ctcgtcgttt ggtatggctt 
cgatcaaggc gagttacatg atcccccatg 
ctccttcggt cctccgatcg ttgtcagaag 
cactcatggt tatggcagca ctgcataatt 
gtaagatgct tttctgtgac tggtgagtac 
atagtgtatg cggcgaccga gttgctcttg 
ataccgcgcc acatagcaga actttaaaag 
tcttcggggc gaaaactctc aaggatctta 
gatgtaaccc actcgtgcac ccaactgatc 
ccagcgtttc tgggtgagca aaaacaggaa 
ggaataaggg cgacacggaa atgttgaata 
atattattga agcatttatc agggttattg 
ttgaatgtat ttagaaaaat aaacaaatag 
cgaaaagtgc cacctgacgc gccctgtagc 
tgtggtggtt acgcgcagcg tgaccgctac 
ccgctccttt cgctttcttc ccttcctttc 
ccccgtcaag ctctaaatcg ggggctccct 
tttacggcac ctcgacccca aaaaacttga 
gtgggccatc gccctgatag acggtttttc 
acgttcttta atagtggact cttgttccaa 
tatctcggtc tattcttttg atttataagg 
attggttaaa aaatgagctg atttaacaaa 



PCT/US99/28451 



caacccggta 


agacacgact 


2000 


ggattagcag 


agcgaggtat 


2050 


tggcctaact 


acggctacac 


2100 


gctgaagcca 


gttaccttcg 

mJ +-m 


2150 


aacaaaccac 


cgctggtagc 


2200 


acgcgcagaa 


aaaaaggatc 


2250 


gtctgacgct 

mm 


cagtggaacg 


2300 


gattatcaaa 

mm 


aaggatcttc 

mm m* 


2350 


tttaaatcaa 


tctaaagtat 

mm 


2400 


atgcttaatc 


agtgaggcac 


2450 


ccatagttgc 


ctgactcccc 


2500 


ttaccatctg 


gccccagtgc 


2550 


ggctccagat 

— ' mm 


ttatcagcaa 


2600 


gaagtggtcc 

*m ^m ^m mm 


tgcaacttta 


2650 


cgggaagcta 


gagtaagtag 


2700 


tgccattgct 

mm mm 


acaggcatcg 


2750 


cattcagctc 

mm 


cggttcccaa 


2800 


ttgtgcaaaa 


aagcggttag 


2850 


taagttggcc 


gcagtgttat 


2900 


ctcttactgt 


catgccatcc 


2950 


tcaaccaagt 


cattctgaga 


3000 


cccggcgtca 

mm mm mm 


atacgggata 


3050 


tgctcatcat 

mam 


tggaaaacgt 

mm mm* mm 


3100 


ccgctgttga 

mm mm mm 


gatccagttc 

mm 


3150 


ttcagcatct 


tttactttca 


3200 


ggcaaaatgc 

mm m+ <w/ 


cgcaaaaaag 

— ' mm 


3250 


ctcatactct 


tcctttttca 


3300 


tctcatgagc 


ggatacatat 


3350 


gggttccgcg 

mm mm mm mm mm 


cacatttccc 


3400 


ggcgcattaa 

mm* mm mm 


gcgcggcggg 

mm mm mm mm mm mm mm 


3450 


acttgccagc 

mm mm 


gccctagcgc 

mm mm mm 


3500 


tcgccacgtt 

—m mm* 


cgccggcttt 

mm mm mm 


3550 


ttagggttcc 

mJ -mm mm 


gatttagtgc 


3600 


ttagggtgat 


ggttcacgta 


3650 


gccctttgac 


gttggagtcc 


3700 




CaCtCdaCCC 


J / DU 


gattttgccg 


atttcggcct 


3800 


aatttaacgc 


gaattttaac 


3850 



SEQ 8/14 



WO 00/34435 



PCT/US99/28451 



aaaatattaa cgtttacaat ttcccattcg 
ttgggaaggg cgatcggtgc gggcctcttc 
taccatgata agtaagtaat attaaggtac 
gcaataaaat atctttattt tcattacatc 
gaatcgatag tactaacata cgctctccat 
caaactagca aaataggctg tccccagtgc 
ttctctatcg ata 



ccattcaggc tgcgcaactg 3900 
gctattacgc cagcccaagc 3950 
gggaggtact tggagcggcc 4000 
tgtgtgttgg ttttttgtgt 4050 
caaaacaaaa cgaaacaaaa 4100 
aagtgcaggt gccagaacat 4150 

4163 



<210> 13 

<211> 4300 

<212> DNA 

<213> artificial sequence 
<220> 

<223> synthesized pCRE5-d2EGFP 



<400> 


13 










ggtaccgagc 


tcttacgcgt 


gctagcggga 


atttccggga 


atttccggga 


50 


atttccggga 


atttccagat 


cctcgagccc 


atggccgtca 


tactgtgacg 


100 


tccccatggc 


cgtcatactg 


tgacgtcccc 


atggccgtca 


tactgtgacg 


150 


tccccatggt 


cgtcatactg 


tgacgtcccc 


atggccgtca 


tactgtgacg 


200 


tcccgcggtc 


gactaccctt 


caatcattgg 


atggaatttc 


ctgttgatcc 


250 


cagggcttag 


atgcaggtgg 


aaacactctg 


ctggtataaa 


agcaggtgag 


300 


gacttcatta 


actgcagtta 


ctgagaactc 


ataagacgaa 


gctaaaatcc 


350 


ctcttcggaa 


gcttccccga 


attccggatg gtgagcaagg gcgaggagct 


400 


gttcaccggg 


gtggtgccca 


tcctggtcga 


gctggacggc 


gacgtaaacg 


450 


gccacaagtt 


cagcgtgtcc 


ggcgagggcg 


agggcgatgc 


cacctacggc 


500 


aagctgaccc 


tgaagttcat 


ctgcaccacc 


ggcaagctgc 


ccgtgccctg 


550 


gcccaccctc 


gtgaccaccc 


tgacctacgg 


cgtgcagtgc 


ttcagccgct 


600 


accccgacca 


catgaagcag 


cacgacttct 


tcaagtccgc 


catgcccgaa 


650 


ggctacgtcc 


aggagcgcac 


catcttcttc 


aaggacgacg 


gcaactacaa 


700 


gacccgcgcc 


gaggtgaagt 


tcgagggcga 


caccctggtg 


aaccgcatcg 


750 


agctgaaggg 


catcgacttc 


aaggaggacg 


gcaacatcct 


ggggcacaag 


800 


ctggagtaca 


actacaacag 


ccacaacgtc 


tatatcatgg 


ccgacaagca 


850 


gaagaacggc 


atcaaggtga 


acttcaagat 


ccgccacaac 


atcgaggacg 


900 


gcagcgtgca gctcgccgac 


cactaccagc 


agaacacccc 


catcggcgac 


950 


ggccccgtgc 


tgctgcccga 


caaccactac 


ctgagcaccc 


agtccgccct 


1000 


gagcaaagac 


cccaacgaga 


agcgcgatca 


catggtcctg 


ctggagttcg 


1050 


tgaccgccgc 


cgggatcact 


ctcggcatgg 


acgagctgta 


caagaagctt 


1100 


agccatggct 


tcccgccgga 


ggtggaggag 


caggatgatg 


gcacgctgcc 


1150 



SEQ 9/14 



WO 00/34435 

catgtcttgt gcccaggaga gcgggatgga 
cttctgctag gatcaatgtg tagtctagag 
gagcagacat gataagatac attgatgagt 
atgcagtgaa aaaaatgctt tatttgtgaa 
atttgtaacc attataagct gcaataaaca 
ttcattttat gtttcaggtt cagggggagg 
aagtaaaacc tctacaaatg tggtaaaatc 
tgcccttgag agccttcaac ccagtcagct 
atgactatcg tcgccgcact tatgactgtc 
aggacaggtg ccggcagcgc tcttccgctt 
cgctcggtcg ttcggctgcg gcgagcggta 
aatacggtta tccacagaat caggggataa 
caaaaggcca gcaaaaggcc aggaaccgta 
tttttccata ggctccgccc ccctgacgag 
aagtcagagg tggcgaaacc cgacaggact 
cccctggaag ctccctcgtg cgctctcctg 
ggatacctgt ccgcctttct cccttcggga 
ctcacgctgt aggtatctca gttcggtgta 
gctgtgtgca cgaacccccc gttcagcccg 
aactatcgtc ttgagtccaa cccggtaaga 
agcagccact ggtaacagga ttagcagagc 
cagagttctt gaagtggtgg cctaactacg 
tttggtatct gcgctctgct gaagccagtt 
tagctcttga tccggcaaac aaaccaccgc 
tttgcaagca gcagattacg cgcagaaaaa 
ttgatctttt ctacggggtc tgacgctcag 
agggattttg gtcatgagat tatcaaaaag 
taaattaaaa atgaagtttt aaatcaatct 
tggtctgaca gttaccaatg cttaatcagt 
ctgtctattt cgttcatcca tagttgcctg 
ctacgatacg ggagggctta ccatctggcc 
cgagacccac gctcaccggc tccagattta 
cggaagggcc gagcgcagaa gtggtcctgc 
agtctattaa ttgttgccgg gaagctagag 
agtttgcgca acgttgttgc cattgctaca 
gtcgtttggt atggcttcat tcagctccgg 
ttacatgatc ccccatgttg tgcaaaaaag 
ccgatcgttg tcagaagtaa gttggccgca 



PCT/US99/28451 

ccgtcaccct gcagcctgtg 1200 
tcggggcggc cggccgcttc 1250 
ttggacaaac cacaactaga 1300 
atttgtgatg ctattgcttt 1350 
agttaacaac aacaattgca 1400 
tgtgggaggt tttttaaagc 1450 
gataaggatc cgtcgaccga 1500 
ccttccggtg ggcgcggggc 1550 
ttctttatca tgcaactcgt 1600 
cctcgctcac tgactcgctg 1650 
tcagctcact "caaaggcggt 1700 
cgcaggaaag aacatgtgag 1750 
aaaaggccgc gttgctggcg 1800 
catcacaaaa atcgacgctc 1850 
ataaagatac caggcgtttc 1900 
ttccgaccct gccgcttacc 1950 
agcgtggcgc tttctcatag 2000 
ggtcgttcgc tccaagctgg 2050 
accgctgcgc cttatccggt 2100 
cacgacttat cgccactggc 2150 
gaggtatgta ggcggtgcta 2200 
gctacactag aaggacagta 2250 
accttcggaa aaagagttgg 2300 
tggtagcggt ggtttttttg 2350 
aaggatctca agaagatcct 2400 
tggaacgaaa actcacgtta 2450 
gatcttcacc tagatccttt 2500 
aaagtatata tgagtaaact 2550 
gaggcaccta tctcagcgat 2600 
actccccgtc gtgtagataa 2650 
ccagtgctgc aatgataccg 2700 
tcagcaataa accagccagc 2750 
aactttatcc gcctccatcc 2800 
taagtagttc gccagttaat 2850 
ggcatcgtgg tgtcacgctc 2900 
ttcccaacga tcaaggcgag 2950 
cggttagctc cttcggtcct 3000 
gtgttatcac tcatggttat 3050 



SEQ 10/14 



WO 00/34435 

ggcagcactg cataattctc ttactgtcat 
ctgtgactgg tgagtactca accaagtcat 
cgaccgagtt gctcttgccc ggcgtcaata 
tagcagaact ttaaaagtgc tcatcattgg 
aactctcaag gatcttaccg ctgttgagat 
cgtgcaccca actgatcttc agcatctttt 
gtgagcaaaa acaggaaggc aaaatgccgc 
cacggaaatg ttgaatactc atactcttcc 
atttatcagg gttattgtct catgagcgga 
gaaaaataaa caaatagggg ttccgcgcac 
ctgacgcgcc ctgtagcggc gcattaagcg 
cgcagcgtga ccgctacact tgccagcgcc 
tttcttccct tcctttctcg ccacgttcgc 
taaatcgggg gctcccttta gggttccgat 
gaccccaaaa aacttgatta gggtgatggt 
ctgatagacg gtttttcgcc ctttgacgtt 
gtggactctt gttccaaact ggaacaacac 
tcttttgatt tataagggat tttgccgatt 
tgagctgatt taacaaaaat ttaacgcgaa 
ttacaatttc ccattcgcca ttcaggctgc 
tcggtgcggg cctcttcgct attacgccag 
aagtaatatt aaggtacggg aggtacttgg 
tttattttca ttacatctgt gtgttggttt 
taacatacgc tctccatcaa aacaaaacga 
taggctgtcc ccagtgcaag tgcaggtgcc 



PCT/US99/28451 

gccatccgta agatgctttt 3100 
tctgagaata gtgtatgcgg 3150 
cgggataata ccgcgccaca 3200 
aaaacgttct tcggggcgaa 3250 
ccagttcgat gtaacccact 3300 
actttcacca gcgtttctgg 3350 
aaaaaaggga ataagggcga 3400 
tttttcaata ttattgaagc 3450 
tacatatttg aatgtattta 3500 
atttccccga aaagtgccac 3550 
cggcgggtgt ggtggttacg 3600 
ctagcgcccg ctcctttcgc 3650 
cggctttccc cgtcaagctc 3700 
ttagtgcttt acggcacctc 3750 
tcacgtagtg ggccatcgcc 3800 
ggagtccacg ttctttaata 3850 
tcaaccctat ctcggtctat 3900 
tcggcctatt ggttaaaaaa 3950 
ttttaacaaa atattaacgt 4000 
gcaactgttg ggaagggcga 4050 
cccaagctac catgataagt 4100 
agcggccgca ataaaatatc 4150 
tttgtgtgaa tcgatagtac 4200 
aacaaaacaa actagcaaaa 4250 
agaacatttc tctatcgata 4300 



<210> 14 
<211> 4987 
<212> DNA 

<213> artificial sequence 

<220> 

<223> synthesized pNF-KB-Luc 

<400> 14 

ggtaccgagc tcttacgcgt gctagcggga atttccggga atttccggga 50 
atttccggga atttccagat ctgccgcccc gactgcatct gcgtgttcga 100 
attcgccaat gacaagacgc tgggcggggt ttgtgtcatc atagaactaa 150 
agacatgcaa atatatttct tccggggaca ccgccagcaa acgcgagcaa 200 



SEQ 11/14 
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cgggccacgg ggatgaagca gaagcttggc attccggtac tgttggtaaa 250 
gccaccatgg aagacgccaa aaacataaag aaaggcccgg cgccattcta 300 
tccgctggaa gatggaaccg ctggagagca actgcataag gctatgaaga 350 
gatacgccct ggttcctgga acaattgctt ttacagatgc acatatcgag 400 
gtggacatca cttacgctga gtacttcgaa atgtccgttc ggttggcaga 450 
agctatgaaa cgatatgggc tgaatacaaa tcacagaatc gtcgtatgca 500 
gtgaaaactc tcttcaattc tttatgccgg tgttgggcgc gttatttatc 550 
ggagttgcag ttgcgcccgc gaacgacatt tataatgaac gtgaattgct 600 
caacagtatg ggcatttcgc agcctaccgt ggtgttcgtt tccaaaaagg 650 
ggttgcaaaa aattttgaac gtgcaaaaaa agctcccaat catccaaaaa 700 
attattatca tggattctaa aacggattac cagggatttc agtcgatgta 750 
cacgttcgtc acatctcatc tacctcccgg ttttaatgaa tacgattttg 800 
tgccagagtc cttcgatagg gacaagacaa ttgcactgat catgaactcc 850 
tctggatcta ctggtctgcc taaaggtgtc gctctgcctc atagaactgc 900 
ctgcgtgaga ttctcgcatg ccagagatcc tatttttggc aatcaaatca 950 
ttccggatac tgcgatttta agtgttgttc cattccatca cggttttgga 1000 
atgtttacta cactcggata tttgatatgt ggatttcgag tcgtcttaat 1050 
gtatagattt gaagaagagc tgtttctgag gagccttcag gattacaaga 1100 
ttcaaagtgc gctgctggtg ccaaccctat tctccttctt cgccaaaagc 1150 
actctgattg acaaatacga tttatctaat ttacacgaaa ttgcttctgg 1200 
tggcgctccc ctctctaagg aagtcgggga agcggttgcc aagaggttcc 1250 
atctgccagg tatcaggcaa ggatatgggc tcactgagac tacatcagct 1300 
attctgatta cacccgaggg ggatgataaa ccgggcgcgg tcggtaaagt 1350 
tgttccattt tttgaagcga aggttgtgga tctggatacc gggaaaacgc 1400 
tgggcgttaa tcaaagaggc gaactgtgtg tgagaggtcc tatgattatg 1450 
tccggttatg taaacaatcc ggaagcgacc aacgccttga ttgacaagga 1500 
tggatggcta cattctggag acatagctta ctgggacgaa gacgaacact 1550 
tcttcatcgt tgaccgcctg aagtctctga ttaagtacaa aggctatcag 1600 
gtggctcccg ctgaattgga atccatcttg ctccaacacc ccaacatctt 1650 
cgacgcaggt gtcgcaggtc ttcccgacga tgacgccggt gaacttcccg 1700 
ccgccgttgt tgttttggag cacggaaaga cgatgacgga aaaagagatc 1750 
gtggattacg tcgccagtca agtaacaacc gcgaaaaagt tgcgcggagg 1800 
agttgtgttt gtggacgaag taccgaaagg tcttaccgga aaactcgacg 1850 
caagaaaaat cagagagatc ctcataaagg ccaagaaggg cggaaagatc 1900 
gccgtgtaat tctagagtcg gggcggccgg ccgcttcgag cagacatgat 1950 
aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa 2000 
aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt 2050 
ataagctgca ataaacaagt taacaacaac aattgcattc attttatgtt 2100 

SEQ 12/14 



WO 00/34435 



PCTAJS99/2845I 



tcaggttcag ggggaggtgt gggaggtttt 
acaaatgtgg taaaatcgat aaggatccgt 
cttcaaccca gtcagctcct tccggtgggc 
ccgcacttat gactgtcttc tttatcatgc 
gcagcgctct tccgcttcct cgctcactga 
ggctgcggcg agcggtatca gctcactcaa 
acagaatcag gggataacgc aggaaagaac 
aaaggccagg aaccgtaaaa aggccgcgtt 
tccgcccccc tgacgagcat cacaaaaatc 
cgaaacccga caggactata aagataccag 
cctcgtgcgc tctcctgttc cgaccctgcc 
cctttctccc ttcgggaagc gtggcgcttt 
tatctcagtt cggtgtaggt cgttcgctcc 
accccccgtt cagcccgacc gctgcgcctt 
agtccaaccc ggtaagacac gacttatcgc 
aacaggatta gcagagcgag gtatgtaggc 
gtggtggcct aactacggct acactagaag 
ctctgctgaa gccagttacc ttcggaaaaa 
ggcaaacaaa ccaccgctgg tagcggtggt 
gattacgcgc agaaaaaaag gatctcaaga 
cggggtctga cgctcagtgg aacgaaaact 
atgagattat caaaaaggat cttcacctag 
aagttttaaa tcaatctaaa gtatatatga 
accaatgctt aatcagtgag gcacctatct 
tcatccatag ttgcctgact ccccgtcgtg 
gggcttacca tctggcccca gtgctgcaat 
caccggctcc agatttatca gcaataaacc 
cgcagaagtg gtcctgcaac tttatccgcc 
ttgccgggaa gctagagtaa gtagttcgcc 
ttgttgccat tgctacaggc atcgtggtgt 
gcttcattca gctccggttc ccaacgatca 
catgttgtgc aaaaaagcgg ttagctcctt 
gaagtaagtt ggccgcagtg ttatcactca 
aattctctta ctgtcatgcc atccgtaaga 
gtactcaacc aagtcattct gagaatagtg 
cttgcccggc gtcaatacgg gataataccg 
aaagtgctca tcattggaaa acgttcttcg 
cttaccgctg ttgagatcca gttcgatgta 



ttaaagcaag taaaacctct 2150 
cgaccgatgc ccttgagagc 2200 
gcggggcatg actatcgtcg 2250 
aactcgtagg acaggtgccg 2300 
ctcgctgcgc tcggtcgttc 2350 
aggcggtaat acggttatcc 2400 
atgtgagcaa aaggccagca 2450 
gctggcgttt ttccataggc 2500 
gacgctcaag tcagaggtgg 2550 
gcgtttcccc ctggaagctc 2600 
gcttaccgga tacctgtccg 2650 
ctcatagctc acgctgtagg 2700 
aagctgggct gtgtgcacga 2750 
atccggtaac tatcgtcttg 2800 
cactggcagc agccactggt 2850 
ggtgctacag agttcttgaa 2900 
gacagtattt ggtatctgcg 2950 
gagttggtag ctcttgatcc 3000 
ttttttgttt gcaagcagca 3050 
agatcctttg atcttttcta 3100 
cacgttaagg gattttggtc 3150 
atccttttaa attaaaaatg 3200 
gtaaacttgg tctgacagtt 3250 
cagcgatctg tctatttcgt 33 00 
tagataacta cgatacggga 3350 
gataccgcga gacccacgct 3400 
agccagccgg aagggccgag 3450 
tccatccagt ctattaattg 3500 
agttaatagt ttgcgcaacg 3550 
cacgctcgtc gtttggtatg 3 600 
aggcgagtta catgatcccc 3 650 
cggtcctccg atcgttgtca 3700 
tggttatggc agcactgcat 3750 
tgcttttctg tgactggtga 3800 
tatgcggcga ccgagttgct 3850 
cgccacatag cagaacttta 3900 
gggcgaaaac tctcaaggat 3950 
acccactcgt gcacccaact 4000 



» 
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gatcttcagc atcttttact ttcaccagcg 
ggaaggcaaa atgccgcaaa aaagggaata 
aatactcata ctcttccttt ttcaatatta 
attgtctcat gagcggatac atatttgaat 
ataggggttc cgcgcacatt tccccgaaaa 
tagcggcgca ttaagcgcgg cgggtgtggt 
ctacacttgc cagcgcccta gcgcccgctc 
tttctcgcca cgttcgccgg ctttccccgt 
ccctttaggg ttccgattta gtgctttacg 
ttgattaggg tgatggttca cgtagtgggc 
tttcgccctt tgacgttgga gtccacgttc 
ccaaactgga acaacactca accctatctc 
aagggatttt gccgatttcg gcctattggt 
caaaaattta acgcgaattt taacaaaata 
ttcgccattc aggctgcgca actgttggga 
cttcgctatt acgccagccc aagctaccat 
gtacgggagg tacttggagc ggccgcaata 
catctgtgtg ttggtttttt gtgtgaatcg 
ccatcaaaac aaaacgaaac aaaacaaact 
gtgcaagtgc aggtgccaga acatttctct 
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tttctgggtg agcaaaaaca 4050 
agggcgacac ggaaatgttg 4100 
ttgaagcatt tatcagggtt 4150 
gtatttagaa aaataaacaa 4200 
gtgccacctg acgcgccctg 4250 
ggttacgcgc agcgtgaccg 4300 
ctttcgcttt cttcccttcc 4350 
caagctctaa atcgggggct 4400 
gcacctcgac cccaaaaaac 4450 
catcgccctg atagacggtt 4500 
tttaatagtg gactcttgtt 4550 
ggtctattct tttgatttat 4600 
taaaaaatga gctgatttaa 4650 
ttaacgttta caatttccca 4700 
agggcgatcg gtgcgggcct 4750 
gataagtaag taatattaag 4800 
aaatatcttt attttcatta 4850 
atagtactaa catacgctct 4900 
agcaaaatag gctgtcccca 4950 
atcgata 4987 
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